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1.  Introduction.  A common  problem  faced  by  an  experimenter  is  one  of 
comparing  several  categories  or  populations.  The  classical  approach  to 
this  problem  has  been  to  use  a test  for  homogeneity,  i.e.,  to  test  whether 
all  categories  (populations)  are  identical  or  not.  This  approach  is  not 
always  realistic  and  it  is  often  inadequate.  The  inadequacy  lies  in  the 
fact  that  only  two  decisions,  accept  or  reject  the  hypothesis,  are  available 
to  the  experimenter.  The  experimenter  is  then  faced  with  the  problem  of 
what  to  do  next,  especially  if  the  hypothesis  is  rejected.  These  difficulties 
and  inadequacies  may  be  alleviated  by  formulating  the  problem  as  multiple 
decision  problems  aimed  at  selection  or  ranking  (ordering)  of  the  k populations. 
This  has  led  to  the  rapid  development  of  selection  and  ranking  theory  during  the 
last  two  decades.  Many  reasonable  rules  have  been  proposed.  Some  desirable 
properties  of  these  ru’es  have  been  studied.  However,  very  little  work  has 
been  done  to  study  the  optimality  of  selection  procedures,  especially  in  the 
subset  selection  approach.  In  this  paper,  we  are  interested  in  deriving  some 
methods  to  construct  optimal  subset  selection  procedures.  Some  classical 
selection  rules  are  constructed  as  special  cases. 

Let  represent  k(>^  2)  populations  (treatments)  and  let 

X^,...,X.n  be  ni  independent  random  observations  from  1^. . The  quality  of 
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for  any  purpose  of  the  United  States  Government. 


D 


; 


m 


the  ith  population  «.  is  characterized  by  a real-valued  parameter  o.,  usually 


unknown.  Let  u ■ l o | u * - (0j,...,0k)l  denote  the  whole  parameter  space. 


Let  * ty(o)  bt  a measure  of  separation  between  and  n ^ . We  assume  that 
there  exists  a monotonical  ly  nonincreasing  function  h such  that  i.j  * hli^). 


Let  ii1  * {© ! x1  j > xQ,  j + i),  1 < i I k,  and  uQ  = n-n,  where  u * U Thus 

k 


h = iju.  is  a partition  of  n.  For  this  problem,  we  assume  xn  and  i..  as 
i=0  ' 


known  with  i„  > t..  for  all  i.  Let  x.  » min  x..,  1 < i < k.  We  define 
U il  • j ^ j i J 


t*  = max  i . The  population  associated  with  i*  will  be  called  the  best 

l<z<k  4 


population.  It  should  be  pointed  out  that  if  o Gil.,  then  i.  - i . for  all  ,1. 

• I J 

since  for  some  j,  j f i,  x.j  * h(x^)  1 h(xQ)  ^ h( T i i > = rii  v ’o-  Thus 
t)  Gu.,  then  n.  is  the  best  population.  A selection  of  a siAiset  containing 
the  best  population  is  called  a correct  selection  (CS).  (Note  that  in  case 
of  ties  any  one  of  the  best  populations  corresponding  to  i*  is  "tagged"  as  the 
best  population.)  To  illustrate  the  above  notation,  we  assume  that  the  observa- 


tions are  drawn  from  u.  which  has  a normal  distribution  with  unknown  mean 

2 

o,  (i  = l,...,k)  and  known  variance  o . We  can  define  x..  o.-o.;  then  it 

1 * J * J 


can  be  seen  that  x^  * °i‘°[k]  ^ °i  < °[k]  dncl  *i  * 01~°[k-i]  °i  3 °[kj’ 


ion 


where  £ ...  i 1°  this  case,  x^  3 0 for  all  i.  Thus  the  populate 

with  the  largest  meun,  o^j,  Is  the  best.  If  instead  x-^  0^-0.  then  the  popula- 


tion with  the  smallest  mean  would  be  the  best.  In  the  above  example,  we 


have  h(t)  ■ -t  which  is  a decreasing  function. 

Let  the  observed  sample  vector  be  denoted  by  X’  * (Xj,...,Xk)  where  X( 
has  components  X^,...,X^n  , i = l,...,k.  Let  «S  3 (6|,...,i'k)  be  a selection 


procedure  where  <Sj(x)  is  the  probability  of  selecting  n.  (1  ' i ■ k)  based  on 
the  observed  vector  X = x from  k populations.  As  measures  of  goodness  or 


I 


I 


iJ 
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3 l 


optimality  of  a selection  rule,  consider  two  quantities  (cf.  Lehmann  [1]) 

k ( i \ 

R(o ,6 ) and  S(o,a).  We  define  S(o,<s)  = P ( CS 1 6 ) and  R ( « ' . A ) - ) R'w(o,ks.), 

where  Rv  ;(o,6.)  = PiSelecting  njfi).  Let  S be  the  size  of  the  selected 
.subset.  Thus  R(o,>s)  = E(S|fi).  For  a specified  y,  (0  < y < 1 ) , we  may 
restrict  our  attention  to  the  class  #of  all  6 such  that 


(1 ) S ( o , vS ) > y for  o 6 ii.  * 

• 0 

We  are  interested  in  constructing  an  optimal  procedure  6 in  which  minimizes 
the  supremum  of  R(o,6)  over  w for  all  6 € 56',  i.e., 

(2)  sup  R(o,6°)  = min  sup  R(o,6). 
yen  * aetfoen 

We  restrict  attention  to  those  selection  procedures  which  depend  upon  the 
observations  onlythrough  a sufficient  and  maximal  invariant  statistic  Z.. 
which  are  defined  as  follows: 


ZU*f(Xil VXjl Xjnj> 


This  Z..  is  based  on  the  n.  and  n.  observations  from  n.  and  n.  (i,j  1 k), 

I J ' J 1 J 

respectively.  It  is  well  known  that  the  distribution  of  depends  only  on 


1J 


For  any  i,  let  the  joint  density  of  Z^,  j f i , be  po(z-).  Let  p^z.)  be 


denoted  by  Pq  ( ? y ) when  =*...*  = constant  and  by  F^Uj)  when 


i 1 


. - T 


ik 


= lo’  ^ 1 ^ i In  a given  problem  the  function  f is  so  chosen 

as  to  indicate  the  measure  of  separation  between  the  populations  in  a 

reasonable  way.  In  case  of  the  above  normal  means  example,  a choice  of 

n.  n . 

i l J 

Zj.  might  be  5^  - 51.,  where  5^  ■ — \ Xie  and  5L  = — \ Xip.  Let  v be  a 


Mj 

o-finite  measure  on  R 


'i  t=l 


k-1 


j 


Jf 


I 
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Oosterhoff  l?]  defines  a monotone  likelihood  ratio  tor  a random  vector 
X with  m components  as  fojjows:  Let  0 he  an  m-dimensional  vector  of  parameters 

A partial  orderiny  ot  points  in  R11  is  defined  hy  Xj  ^ v, . x!  (x-j x ^ |n ) » 

i * 1,2,  meaniny  that  Xjj  < x.,j  for  j • l,2,...,k,  and  the  inequality  is 
strict  for  at  least  one  component.  The  density  t^(x)  has  monotone  likelihood 
ratio  (M^R)  if  for  all  ^ ^ f(,  (x)/f(i  lx)  is  nondecreasiny  in  x. 

Now  we  state  and  prove  a theorem  which  provides  a solution  to  the 
restricted  minimax  problem  as  stated  in  (1)  and  (2)  (cf  telimann  |1]). 

Theorem:  Suppose  that  the  density  p()(z)  has  the  MIR  property.  It  R(>  ,v'>1) 
is  maximized  at  i.j  = t..  = constant,  for  all  i.j,  where  is  yiven  by 


1 if  > c p0(zi). 


such  that  c('  0)  and  are  determined  by  fy. p.  ■ y,  1 - 1 v k.  Then 
6°  * (6?,..., <S?)  minimizes  sup  R(»»,6)  subject  to  inf  S ( e , ,s ) 


Proof.  For  any  <t\ 

i>  € i"  implies  t»  G i.\  for  some  i,  thus 

s(t>,6)  » /61(z1)pe(zi)dv(z1)  21  min  inf  Jt.  (zt  )P0(^  )dv(z  i ) 


lvi.k  o€>:< 


We  have 


inf  S(t,6)  * min  inf  /«. (i.  JpJz, )dv(z, ) . 

6€d  l<1<k  $€1^  1 - 

Hence  for  any  tSGtf,  inf  /tf.(z,)  p At*  )d\>(Zi ) > > , 1 - i k.  and  by  the 

eew,  1-1  - 

0 - 1 

assumption  that  /s^p^  « >,  it  follows  that 


"m 


which  implies 


J(A|-&|)(Pj-cP0)  0 


/6i  “o  - /Vo* 


By  the  assumption,  <s.(z.)  Is  nondecreasiny  in  z.,  we  have 

inf  S(o,«s^)  * min  /a? p.  ■ >. 
oei’i  Ki'k 

If  R( t» »6°)  is  maximized  at  * \ ^ * constant,  for  all  i,j,  then 

^ ^ 0 0 
sup  R( vS ,,s ) > I /,s  p > l /6.p  - sup  R(o ,6  ), 

" i-1  1 u i«l  1 1 efii 

which  completes  the  proof. 

Example:  Let  be  k independent  normal  populations  with  means 

and  common  variance  o - 1 . Our  interest  is  to  select  a nonempty 
subset  of  the  k populations  containing  the  best,  lhe  ordered  o.’s  are 
denoted  by  <. . It  Is  assumed  that  there  is  no  prior  knowledge 

of  the  correct  pairing  of  the  ordered  and  the  unordered  o.’s.  Our  goal  is 
to  select  a nonempty  subset  of  the  k populations  so  as  to  include  the 
population  associated  with  e^j. 

Let  H|,  1 v i v k,  denote  the  sample  means  of  independent  samples  of  size 
n from  these  populations.  Let  the  joint  1 Ikel ihood  function  of  X, , M ,£,...  ,k,  be 


where 


9rt(x)  ■ H 90  (xJ, 

2 j-1  ll  1 

9e  (x,)  - ^_e  2 1 1 . 1 < 1 < k.  Let 

°i  ’ vfii 


lij  ” Tijt®^  " 0j'ej*  ' lJ  ik*  J i ^ ’j)  * °*  ’o  ‘ A ' 0 «hd 
?ij  ■ , j f 1.  Let  i\  • Un zik)  and  ,]  - (tn iikK 


7 


f(0 0) 

W(0,.s), 

hence,  it  follows  that  the  supremum  of  R(o,^)  over  occurs  at  ... 
i.e.,  i.j  = i..  = 3.  Thus  the  result  of  the  theorem  can  be  applied. 

Note  that  the  above  procedure  (3)  is  a rule  of  the  type  proposed  by 
Seal  13]  to  select  a subset  containing  the  population  associated  with  the 
largest  o . ’s. 
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